Everything about The Human Genome totally explained
The
human genome is the
genome of
Homo sapiens, which is stored on 23 chromosome pairs. Twenty-two of these are
autosomal chromosome pairs, while the remaining pair is
sex-determining. The
haploid human genome occupies a total of just over 3 billion
DNA base pairs and has a data size of approximately 750
Megabytes, which is slightly larger than the capacity of a standard
Compact Disc. The
Human Genome Project produced a reference sequence of the
euchromatic human genome, which is used worldwide in
biomedical sciences.
The haploid human genome contains an estimated 20,000–25,000
protein-coding genes, far fewer than had been expected before its sequencing. In fact, only about 1.5% of the genome codes for
proteins, while the rest is comprised of
RNA genes,
regulatory sequences,
introns and (controversially)
"junk" DNA.
Features
Chromosomes
There are 24 distinct human
chromosomes: 22
autosomal chromosomes, plus the
sex-determining X and
Y chromosomes. Chromosomes 1–22 are numbered roughly in order of decreasing size.
Somatic cells usually have 23 chromosome pairs: one copy of chromosomes 1–22 from each parent, plus an X chromosome from the mother, and either an X or Y chromosome from the father, for a total of 46.
Genes
There are estimated 20,000–25,000 human protein-coding
genes.
Surprisingly, the number of human genes seems to be less than a factor of two greater than that of many much simpler organisms, such as the
roundworm and the
fruit fly. However, human cells make extensive use of
alternative splicing to produce several different proteins from a single gene, and the human
proteome is thought to be much larger than those of the aforementioned organisms.
Most human genes have multiple
exons, and human
introns are frequently much longer than the flanking exons.
Human genes are distributed unevenly across the chromosomes. Each chromosome contains various gene-rich and gene-poor regions, which seem to be correlated with
chromosome bands and
GC-content. The significance of these nonrandom patterns of gene density isn't well understood. In addition to protein coding genes, the human genome contains thousands of
RNA genes, including
tRNA,
ribosomal RNA,
microRNA, and other non-coding RNA genes.
Regulatory sequences
The human genome has many different
regulatory sequences which are crucial to controlling
gene expression. These are typically short sequences that appear near or within genes. A systematic understanding of these regulatory sequences and how they together act as a
gene regulatory network is only beginning to emerge from computational, high-throughput expression and
comparative genomics studies.
Identification of regulatory sequences relies in part on evolutionary conservation. The evolutionary branch between the human and
mouse, for example, occurred 70–90 million years ago. So computer comparisons of gene sequences that identify conserved non-coding sequences will be an indication of their importance in duties such as gene regulation.
Another comparative genomic approach to locating regulatory sequences in humans is the gene sequencing of the
puffer fish. These vertebrates have essentially the same genes and regulatory gene sequences as humans, but with only one-eighth the "junk" DNA. The compact DNA sequence of the puffer fish makes it much easier to locate the regulatory genes.
Other DNA
Protein-coding sequences (specifically, coding
exons) comprise less than 1.5% of the human genome. which leads to the possibility that the resulting transcripts may have some unknown function. Also, the evolutionary conservation across the
mammalian genomes of much more sequence than can be explained by protein-coding regions indicates that many, and perhaps most, functional elements in the genome remain unknown. The investigation of the vast quantity of sequence information in the human genome whose function remains unknown is currently a major avenue of scientific inquiry.
Variation
Most studies of human genetic variation have focused on
single nucleotide polymorphisms (SNPs), which are substitutions in individual bases along a chromosome. Most analyses estimate that SNPs occur on average somewhere between every 1 in 100 and 1 in 1,000 base pairs in the
euchromatic human genome, although they don't occur at a uniform density. Thus follows the popular statement that "we are all, regardless of
race, genetically 99.9% the same", although this would be somewhat qualified by most geneticists. For example, a much larger fraction of the genome is now thought to be involved in
copy number variation. A large-scale collaborative effort to catalog SNP variations in the human genome is being undertaken by the
International HapMap Project.
The genomic loci and length of certain types of small
repetitive sequences are highly variable from person to person, which is the basis of
DNA fingerprinting and DNA
paternity testing technologies. The
heterochromatic portions of the human genome, which total several hundred million base pairs, are also thought to be quite variable within the human population (they are so repetitive and so long that they can't be accurately sequenced with current technology). These regions contain few genes, and it's unclear whether any significant
phenotypic effect results from typical variation in repeats or heterochromatin.
Most gross genomic mutations in
germ cells probably result in inviable embryos; however, a number of human diseases are related to large-scale genomic abnormalities.
Down syndrome,
Turner Syndrome, and a number of other diseases result from
nondisjunction of entire chromosomes.
Cancer cells frequently have
aneuploidy of chromosomes and chromosome arms, although a
cause and effect relationship between aneuploidy and cancer hasn't been established.
Genetic disorders
cystic fibrosis is caused by mutations in the CFTR gene, and is the most common recessive disorder in caucasian populations with over 1300 different mutations known. Disease-causing mutations in specific genes are usually severe in terms of gene function, and are fortunately rare, thus genetic disorders are similarly individually rare. However, since there are many genes that can vary to cause genetic disorders, in aggregate they comprise a significant component of known medical conditions, especially in pediatric medicine. Molecularly characterized genetic disorders are those for which the underlying causal gene has been identified, currently there are approximately 2200 such disorders annotated in the OMIM database,.
Studies of genetic disorders are often performed by means of family-based studies. In some instances population based approaches are employed, particularly in the case of so-called founder populations such as those in Finland, French-Canada, Utah, Sardinia, etc. Diagnosis and treatment of genetic disorders are usually performed by a
geneticist-physician trained in clinical/medical genetics. The results of the
Human Genome Project are likely to provide increased availability of
genetic testing for gene-related disorders, and eventually improved treatment. Parents can be screened for hereditary conditions and
counselled on the consequences, the probability it'll be inherited, and how to avoid or ameliorate it in their offspring.
As noted above, there are many different kinds of DNA sequence variation, ranging from complete extra or missing chromosomes down to single nucleotide changes. It is generally presumed that much naturally occurring genetic variation in human populations is phenotypically neutral, for example has little or no detectable effect on the physiology of the individual (although there may be fractional differences in fitness defined over evolutionary time frames). Genetic disorders can be caused by any or all known types of sequence variation. To molecularly characterize a new genetic disorder, it's necessary to establish a causal link between a particular genomic sequence variant and the clinical disease under investigation. Such studies constitute the realm of human molecular genetics.
With the advent of the Human Genome and
International HapMap Project, it has become feasible to explore subtle genetic influences on many common disease conditions such as diabetes, asthma, migraine, schizophrenia, etc. Although some causal links have been made between genomic sequence variants in particular genes and some of these diseases, often with much publicity in the general media, these are usually not considered to be genetic disorders
per se as their causes are complex, involving many different genetic and environmental factors. Thus there may be disagreement in particular cases whether a specific medical condition should be termed a genetic disorder.
Evolution
Comparative genomics studies of mammalian genomes suggest that approximately 5% of the human genome has been conserved by evolution since the divergence of those species approximately 200 million years ago, containing the vast majority of genes.
Humans have undergone an extraordinary loss of
olfactory receptor genes during our recent evolution, which explains our relatively crude sense of
smell compared to most other mammals. Evolutionary evidence suggests that the emergence of
color vision in humans and several other
primate species has diminished the need for the sense of smell.
Mitochondrial genome
The human
mitochondrial genome, while usually not included when referring to the "human genome", is of tremendous interest to geneticists, since it undoubtedly plays a role in
mitochondrial disease. It also sheds light on human evolution; for example, analysis of variation in the human mitochondrial genome has led to the postulation of a recent common ancestor for all humans on the maternal line of descent. (see
Mitochondrial Eve)
Due to the lack of a system for checking for copying errors, Mitochondrial DNA (mtDNA) has a more rapid rate of variation than nuclear DNA. This 20-fold increase in the mutation rate allows mtDNA to be used for more accurate tracing of maternal ancestry. Studies of mtDNA in populations have allowed ancient migration paths to be traced, such as the migration of
Native Americans from
Siberia or
Polynesians from southeastern
Asia. It has also been used to show that there's no trace of
Neanderthal DNA in the European gene mixture inherited through purely maternal lineage.
Epigenome
chromatin packaging,
histone modifications and
DNA methylation, are important in regulating gene expression, genome replication and other cellular processes.
These "epigenetic" features are thought to be involved in cancer and other abnormalities, and some may be heritable across generations.
Further Information
Get more info on 'Human Genome'.
|
External Link Exchanges
Do you know how hard it is to get a link from a large encyclopaedia? Well we're different and will prove it. To get a link from us just add the following HTML to your site on a relevant page:
<a href="http://human_genome.totallyexplained.com">Human genome Totally Explained</a>
Then simply click through this link from your web page. Our crawlers will verify your link, extract the title of your web page and instantly add a link back to it. If you like you can remove the words Totally Explained and embed the link in article text.
As long as your link remains in place, we'll keep our link to you right here. Please play fair - our crawlers are watching. Your site must be closely related to this one's topic. Any kind of spamming, dubious practises or removing the link will result in your link from us being dropped and, potentially, your whole site being banned. |